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(1. Construct DNA library 
2. Isolate positive clone 




13. Excise pBK-CMV phagemid 
containing cloned DNA insert 
by co-infection with helper phage 
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DNA sequence of Tm 13.17 cDNA clone 
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1 AGTCX^TCC^UUUSiU^TTC 

MK LLCC'L I S 

61 CCCTCATTCTGTTGGTCACAGTTCAGGCC^^ 

LILLVTVQA j|L TEAQIE KLNK 

121 AGATCAGOUUVAAATGTCAAAATGAAAGTG 

IS-KKCQNESGVSQEI'ITKAR 

181 GCAACGGTGACTGGGAGGACC^TCCT 

N GDWE D D PKLKRQVF CVA R N 

241 ACGCCGGTCTGGCGACGGAATCGGGAGAGGTGGTGGTCGACGTGTTGAGGGAGAAGGTGA 
AG LATE S GEVV V DVL RE K V R 

301 GGAAGGTCACTGACAACGACGAAGAAACTGAGAAAATC^ 

KVTD ND E ETEKI I NKC.A V K R 

361 GAGATACTGTTGAAGAGACGGTGTTCAATAC 

D TVE E TVFN T FKCVMK N K PK 

421 AGTTCTCACCAGTTGATTGAACCACC^ 
F S P V D * 

X 
h 
o 
I 

481 ATATAAAAATAAAGTGTTTCTGATGTAAAAAAAAAAAAAAAAAAAAAAAAAACTCG 
polyadenylation signal poly (A) tail (26) 

537 AGAGTATTCTAGAGCGGCCGCGGGCCCATCGTTTTCCACCC 
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A. Mature Tm 13.17 amino acid residure 



1 LTEAQIEKLN KISKKCQNES GVSQEIITKA RNGDWEDDPK LKRQVFCVAR 
51 NAGLATESGE VWDVLREKV RKVTDNDEET EKIINKCAVK RDTVEETVFN 
101 TFKCVMKNKP KFSPVD 



B. Summary of the composition analysis for the mature Tm 13.17 
sequence: 

Residue Number Mole Percent 





A 
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5.172 
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= Ser 
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4.310 




T- 


= Thr 


8 


6.897 
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=Val 


14 


12.069 
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= Trp 
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0 .862 
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= Tyr 
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0.000 
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= Glx 


0 


0 .000 



Molecular weight = 13171,96; Residues = 116; Average Residue Weight = 113.551 
Charge = 1 ; Isoelectric point = 7.74. 
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FIG 2.7 



Tm 13.17 3 EAQIEKLNKISKKCQNESGVSQEIITKARNGDWEDDPKLKRQVFCVARNA 52 

I-- III-- I- h-lllihl =-l-ll': IMIII =.|h =-l 
AFP-3 1 ETPREKLKQHSDACKAESGVSEESLNKVRNREEVDDPKLKEHAFCILKRA 50 

Tm 13,17 53 GLATESGEVWDVLREKVRKVTDNDEETEKIINKCAVKRDTVEETVFNTF 102 

|: ..|||. :| = = .|. = . .::.|..:.::.||||| = ll-:-- : I 
AFP-3 51 GFIDASGEFQLDHIKTKFKENSEHPEKVDDLVAKCAVKKDTPQHSSADFF 100 

Tm 13.17 103 KCVMKNKP 110 

III -h- 
AFP-3 101 KCVHDNRS 108 



Percent identity: 39.8 (identical amino acids; Percent similarity: 58.3 (identical amino 
acids plus conservative amino acids). 



FIG 2.8 



l £ 5 



Tm 13.17 
AFP- 3 
B Protein 



M K L L 
M K L L 
L 




P! 
w ■ 

WW 



FIG 2.9 



m 

"ST 

3 = H 



Tin 


13. 


17 


NH2- 


■L 


T 


E 


A 


Q 


I 


E 


K 


L 


N 


K 


I 


S 


K 


K C 


Q 


Tm 


12. 


86 


NH2- 


■L 


T 


D 


E 


Q 


I 


Q 


K 


R 


N 


K 


I 


s 


K 


E ? 


Q 



N E 
Q V 



-4 



FIG 2.10 




I 



03 



Q 




Q[EjI 
E D V 



I 


T 


K 


A 


I 


K 


R 


A 











e[e)s L n[k V 28 



Tml3.17 31 
Bl 44 
AFP-3 29- 



R N 

r|k" 


G D 
G D 


R N 


R E 



E D D P K L K 
E D D P K L K 
D D P K L K 




V A R N|TjG L A T 
IFKALEIVA 



ILK r[aJg F I D A 



E S G E 
E S G E 
S G E 



Tml3.17 


61 


V 


V 


V 


D 


V 


L 


Bl 


75 


I 


E 


A 


D 


T 


F 


AFP-3 


59 


F 


Q 


L 


D 


H 


I 



E K 
E K 



V R K 
L T R 



V 


T 


D 


N 


DEE 


T 


E 


K 


I 


V 


T 


N 


D 


DEE 


S 


E 


K 


I 



I N 
V E 



F K E N S E H 



P^eJk V D 



D L V A 



Tml3.17 


91 


R 


D 


T 


V 


E 


E 


T 


V 


F 


N 


T 




K 


C 


V 


M 


K 


N 


K 


P 


K 


F 


S 


P 


V 


D 


Bl 


106 


E 


D 


T 


P 


E 


D 


T 


A 


F 


E 


V 


T 


K 


C 


V 


L 


K 


D 


K 


P 


N 


F 


F 


G 


D 


L 
























































AFP-3 


89 


K 


D 


T 


P 


Q 


H 


S 


S 


A 


D 


F 


F 


K 


c 


V 


H 


D 


N 


R 


S 















30 
43 



60 
74 
58 



K C A V K] 90 

k c[rjvjT 105 

K C A V K 88 



116 
124 
108 



FIG 2.12 



2-2 



1 GGCACGAGCAAA AlA T GlA AACTCCTCTTGTGCTTTGCGTTCGCCGCC 



MKLLLCFAFA A 

47 ATCGTCATCGGAGCTCAGGCTCTCACCGACGAACAGATACAGAAA 
I VI G A Q A T D E Q I Q K 

92 AGGAACAAGATCAGCAAAGAATGCCAGCAGGTGTCCGGAGTGTCC 
RNKI SKECQQVSGVS 

O 137 C A A GA GA CGAT CGA C A A A GT C CGC A C A GGT GT CT T GGT C G A T GA T 
03 QETI DKVRTGVLVDD 



^5 

81 



182 CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAACTGGA 
PKM-KKHVLCFSKKTG 



m 226 GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGTACTCAAAGCC 
~" VATEAGDTNVEVLKA 

g 

0 1 271 A A GCT G A A GC A T GT GGCC AGO G ACG A A G A GGT GGA C A A G A T C GT G 
Q KLKHVASDEE VDKI V 

SI 

it 316 CAGAAGTGCGTGGTCAAGAAGGCCACACCAGAGGAAACGGCTTAT 
r QKCVVKKAT PEETAY 

361 GACACCTTCAAGTGTATTTACGACAGCAAACCTGATTTCTCTCCT 
DTFK CI YDSKPDFSP 

406 ATTGATTAATTGTTTTGTATTTGACTGAATTTT GAC AAT AAA GGT 
ID* 

polyadenylation signal 

451 ACTATCGTTATGTAAAAAAAAAAAAAAAAA 



poly (A) tail 
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137 CAAGAGACGATCGACAAAGTCCGCACAGGTGTCTTGGTCGACGAT 
Q E T.I DKVRTGVLVDD 

S! 182 CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAACTGGA 

m PKMKKHVLCFSKKTG 

ess 

+! 226 GT GGCAACCGAAGCCGGAGACACCAAT GTGGAGGT ACTCAAAGCC 

yi VATEAGDTNVEVLKA 

53 

m 271 AAGCTGAAGCATGTGGCCAGCGACGAAGAAGTGGACAAGATCGTG 

£ KLKHVASDEEVDKI V 

X. 3 

O 316 CAGAAGTGCGTGGTCAAGAAGGCCACACCAGAGGAAACGGCTT AT 

M> QKCVVKKATPEETAY 

361 GACACCTTCAAGTGTATTTACGACAGTAAACCTGATTTCTCTCCT 
DTFKCI YDSKPDFSP 

406 ATTGATTAATTGTTTTGTATTTGACTGAATTT T G A C A A T A A A G G T 
ID* 

polyadenylation signal 

451 ACTATCGTTATGAAAAAAAAAAAAAAAAAA 
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Reverse Primer 
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percent % composition 
Primer A C G T MeltingTemperature (°C ) 



Forward 2a6 14.3 42.9 14.3 44.0 
Reverse 25.0 31.3 .6.3 37.5 44.0 



FIG 4.6 




FIG 4.7 




FIG 4.8 




FIG 4.9 



3-4 



1 GGCACGAGCAAA AJaTgJa AACTCCTCTTGTGCTTTGCTTTCGCCGCC 

MKLLLCFAFAA 

47 ATCGTCATCGGAGCTCAGGCTCTCACCGACGAACAGATACAGAAA 
I VI G A Q A T D E Q I Q K 

92 AGGAACAAGATCAGCAAAGAATGCCAGCAGGTGTCCGGAGTGTCC 
RNKI SKECQQVSG VS 

137 CAAGAGACGATCGACAAAGTCCGCACAGGTGTCTTGGTCGACGAT 
QETI DKVRTGVLVDD 

182 CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAACTGGA 
PKMKKHVLCFSKKTG 

226 G T G G C A A CC GAAGCCGGAGACACCAATGTGGAGGTACTCAAAGCC 
VATEAGDTNVEVLKA 

271 A A GCT G A A GC AT GT GGCC A GCG A C GA A GA GGT GG A C A A GAT C GT G 
KLKH VASDEEVD KI V 

316 CAGAAGTGCGTGGTCAAGAAGGCCACACCAGAGGAAACGGCTTAT 
QKCVVKKATPEETAY 

361 GACACCTTCAAGGTTATTTACGACAGTAAACCTGATTTCTCTCCT 
D TFKVI YDSKPDFSP 

406 ATTGATTAATTGTTTTGTATTTGACTGAATTT T G A C A A T A A A G G T 
I D * 

polyadenylation signal 

451 ACTATCGTTATGTAAAAAAAAAAAAAAAAA 



poly (A) tail 



FIG. 4.10 a 



Analysis 


Whole Protein 


Molecular Weight 


12839.70 m.w. 


Length 


115 


1 microgram = 


77.883 pMoles 


Molar Extinction coefficient 


2920±5% 


1 A(280) = 


4.40 mg/ml 


Isoelectric Point 


7.14 


Charge at pH 7 


0.16 



Whole Protein Composition Analysis 





Number 


%by 


%by 


Amino Acid(s) 


count 


weight 


frequency 


Charged (RKHYCDE) 


47 


46.41 


40.87 


Acidic (DE) 


20 


18.91 


17.39 


Basic (KR) 


20 


20.41 


17.39 


Polar (NCQSTY) 


29 


24.55 


25.22 


Hydrophobic (AILFWV) 


35 


28.04 


30.43 


A Ala 


6 


3.32 


5.22 


CCys 


3 


2.41 


2.61 


D Asp 


11 


9.86 


9.57 


EGlu 
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9.05 


7.83 


FPhe 
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3.44 


2.61 


GGly 


4 


1.78 


3.48 


H His 


2 


2.14 


1.74 


I He 


6 


5.29 


5.22 


KLys 


18 


17.97 


15.65 


LLeu 


5 


4.41 


4.35 


M Met 


1 


1.02 


0.87 


N Asn 


2 


1.78 


1.74 


P Pro 


4 


3.02 


3.48 


Q Gin 


6 


5.99 


5.22 


R Arg 


2 


2.43 


1,74 


SSer 


7 


4.75 


6.09 


TThr 


9 


7.09 


7.83 


VVai 


15 


11.58 


13.04 


WTrp 


0 


0.00 


0.00 


YTyr 


2 


2.54 


1.74 


B Asx 


0 


0.00 


0.00 


ZGlx 
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0.00 


0.00 


XXxx 
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0.00 


0.00 


.Ter 


0 


0.00 


0.00 



Predicted Amino Acid 



Composition of 3-4 



FIG. 4.10 b 



3-9 



GGCACGAGCAAA A lA T Gl A AACTCCTCTTGTGCTTTGCTTTCGCCGCC 

MKL LLCFAFAA 

ATCGTCATCGGAGCTCAGGCTCTCACCGATGAACAGATACAGAAA 
I VI GAQAALTDEQI QK 



AGGAACAAGATCAGCAAAGAATGCCAGCAGGAGTCCGGAGTGTCC 
RNKI SKECQQESGVS 

CAAGAGACGATCGACAAAGTCCGCACAGGTGTCTTGGTCGACGAT 
QETI DKVRTGVLVDD 

CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAGAACTGGA 
PKMKKHVLCFSKRTG 

GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGTACTCAAAGCC 
VATEAGDTNVEVLKA 

AAGCTGAAGCATGTG GC CA. GCGACGAAGAAGTGGACAAGATCGTG 
KLKHVASDEEVDKI V 

CAGAAGTGCGTGGTCAAGAAGGCCACACCAGAGGAAACGGCTTAT 
QKCVV KKATPEETAY 

GACACCTTCAAGTGTATTTACGACAGTAAACCTGATTTCTCTCCT 
DTFKVI YDSKPDFSP 

ATTGATTAATTGTTTTGTATTTGACTGAATTT T G A C A A T A A A G G T 
I D * 

polyadenylation signal 

ACTATCGTTATGAAAAAAAAAAAAAAAAAA 

poly (A) tail 



FIG. 4.11 a 



Predicted Amino Acid 
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Analysis 


Whole Protein 


Molecular Weight 


12871.80 m.w. 


Length 


115 


1 microgram = 


77.689 pMoles 


Molar Extinction coefficient 


3040±5% 


1 A(280) = 


4.23 mg/ml 


Isoelectric Point 


7.11 


Charge at pH 7 


0.13 



Whole Protein Composition Analysis 





Number 


%by 
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Amino Acid(s) 


count 


weight 


frequency 


Charged (RKHYCDE) 


48 


47.31 


41.74 


Acidic (DE) 
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18.86 


17.39 


Basic (KR) 
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20.57 


17.39 


Polar (NCQSTY) 
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25.29 


26.09 


Hydrophobic (AILFWV) 
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29.57 


A Ala 
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3.31 


5.22 


CCys 
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D Asp 


11 


9.84 


9.57 
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F Phe 
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H His 
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l lie 
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K Lys 
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Composition of 3-9 



FIG. 4.11 b 



7-5 



1 GGCACGAGCAAA A lAT Gl A AACTCCTCTTGTGCTTTGCGTTCGCCGCC 

MKLLLCFAF AA 

47 ATCGTCATCGGAGCTCAGGCTCTCACCGACGAACAGATACAGAAA 
I VI G A Q A j^L T D E Q i Q K 

92 AGGAACAAGATCAGCAAAGAGTGCCAGCAGGTGTCCGGAGTGTCC 
RNK! SKECQQESGVS 

137 CAAGAGACGATCGACAAAGTCCGCACAGGTGTCTTGGTCGACGAT 
QET I D KVR T GV L V DD 

182 CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAACTGGA 
PKMKKHVLCFSKRTG 

226 GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGTACTCAAAGCC 
VATEAGDTNVEVLKA 

271 AAGCTGAAGCATGTGGCCAGCGACGAAGAAGTGGACAAGATCGTG 
KLKHVASDEEVDKI V 

316 CAGAAGTGCGTGGTCAAGAAGGCCACACCAGAGGAAACGGCTTAT 
QKCVVKKAT PE ETAY 

361 GACACCTTCAAGTGTATTTACGACAGTAAACCTGATTTCTCTCCT 
DTFKVI YDSKPDFSP 

406 ATTGATTAATTGTTTTGTATTTGGCTGAATTTT GAC AATAAA GGT 
I D * '■ 

polyadenylation signal 

451 ACTATCGTTATGTAAAAAAAAAAAAAAAAA 



poly (A) tail 



FIG. 4.12 a 



Predicted Amino Acid 



Composition of 7-5 



Analysis 


Whole Protein 


Molecular Weight 


12843.80 m.w. 


Length 


115 


1 microgram = 


77.859 pMoles 


Molar Extinction coefficient 


3040±5% 


1 A(280) = 


4.22 mg/ml 


Isoelectric Point 


7.11 


Charge at pH 7 


0.13 



Whole Protein Composition Analysis 
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D Asp 
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FIG. 4.12 b 
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His-tagged clone 2.2 without signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 

His-tag Start Codon 
AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 
Met Gly Ser Ser His His His His His His Ser 
-30 -25 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-20 -15 -10 

N-terminal of mature AFP 
GGA CAG CAA ATG GGT CGC GGA TCC CTC ACC GAC GAA CAG ATA CAG 
Gly Gin Gin Met Gly Arg Gly Ser Leu Thr Asp Glu Gin lie Gin 
-5 15 

AAA AGG AAC AAG ATC AGC AAA GAA TGC CAG CAG GTG TCC GGA GTG 
Lys Arg Asn Lys lie Ser Lys Glu Cys Gin Gin Val Ser Gly Val 
10 15 20 

TCC CAA GAG ACG ATC GAC AAA GTC CGC ACA GGT GTC TTG GTC GAT 
Ser Gin Glu Thr lie Asp Lys Val Arg Thr Gly Val Leu Val Asp 
25 30 35 

GAT CCC AAA ATG AAG AAG CAC GTC CTC TGC TTC TCG AAG AAA ACT 
Asp Pro Lys Met Lys Lys His Val Leu Cys Phe Ser Lys Lys Thr 
40 45 50 

GGA GTG GCA ACC GAA GCC GGA GAC ACC AAT GTG GAG GTA CTC AAA 
Gly Val Ala Thr Glu Ala Gly Asp Thr Asn Val Glu Val Leu Lys 
55 60 65 

GCC AAG CTG AAG CAT GTG GCC AGC GAC GAA GAG GTG GAC AAG ATC 
Ala Lys Leu Lys His Val Ala Ser Asp Glu Glu Val Asp Lys lie 
70 75 80 

GTG CAG AAG TGC GTG GTC AAG AAG GCC ACA CCA GAG GAA ACG GCT 
Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu Glu Thr Ala 
85 90 95 

TAT GAC ACC TTC AAG TGT ATT TAC GAC AGT AAA CCT GAT TTC TCT 
Tyr Asp Thr Phe Lys Cys lie Tyr Asp Ser Lys Pro Asp Phe Ser 
100 105 110 

Stop Codon 

CCT ATT GAT TAA CTCGAGCACC ACCACCACCA CCACTGAGAT 
Pro lie Asp * 
115 
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His-tagged clone 2.3 with signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His-tag Start Codon 
AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 96 
Met Gly Ser Ser His His His His His His Ser 
-55 -50 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-45 -40 -35 

AFP Start Codon 

GGA CAG CAA ATG GGT CGC GGA TCC GAA TTC GCA CGA GCA AAA ATG 186 
Gly Gin Gin Met Gly Arg Gly Ser Glu Phe Ala Arg Ala Lys Met 
-30 -25 -20 

AAA CTC CTC TTG TGC TTT GCT TTC GCC GCC ATC GTC ATC GGA GCT 231 
Lvs Leu Leu Leu Cvs Phe Ala Phe Ala Ala lie Val lie Glv Ala 
-15 -10 -5 

N- terminal of Mature AFP 

CAG GCT CTC ACC GAC GAA CAG ATA CAG AAA AGG AAC AAG ATC AGC 27 6 

fx*. Gin Ala Leu Thr Asp Glu Gin lie Gin Lys Arg Asn Lys lie Ser 

15 10 

iO 

03 AAA GAA TGC CAG CAG GTG TCC GGA GTG TCC CAA GAG ACG ATC GAC 321 

Si Lys Glu Cys Gin Gin Val Ser Gly Val Ser Gin Glu Thr lie Asp 

^ 15 20 25 

|jj AAA GTC CGC ACA GGT GTC TTG GTC GAT GAT CCC AAA ATG AAG AAG 3 66 

£ Lys Val Arg Thr Gly Val Leu Val Asp Asp Pro Lys Met Lys Lys 

ffl 30 35 40 

3 ^ CAC GTC CTC TGC TTC TCG AAG AAA ACT GGA GTG GCA ACC GAA GCC 411 

Q His Val Leu Cys Phe Ser Lys Lys Thr Gly Val Ala Thr Glu Ala 

fil 45 50 55 

•or 11 

; ?S J GGA GAC ACC AAT GTG GAG GTA CTC AAA GCC AAG CTG AAG CAT GTG 456 

% 'H Gly Asp Thr Asn Val Glu Val Leu Lys Ala Lys Leu Lys His Val 

p 60 65 70 

Mi 

GCC AGC GAC GAA GAA GTG GAC AAG ATC GTG CAG AAG TGC GTG GTC 501 

Ala Ser Asp Glu Glu Val Asp Lys lie Val Gin Lys Cys Val Val 
75 80 85 

AAG AAG GCC ACA CCA GAG GAA ACG GCT TAT GAC ACC TTC AAG TGT 546 
Lys Lys Ala Thr Pro Glu Glu Thr Ala Tyr Asp Thr Phe Lys Cys 
90 95 100 

Stop Codon 

ATT TAC GAC AGT AAA CCT GAT TTC TCT CCT ATT GAT TAA TTGTTTTGTA 59 5 
lie Tyr Asp Ser Lys Pro Asp Phe Ser Pro lie Asp * 
105 110 115 

Polyadenylation signal Poly-A tail 
TTTGACTGAA TTTTGAC AAT AAA GGTACTA TCGTTATGAA AAAAAAAAAA 645 

AAAAAAACTC GAGCACCACC ACCACCACCA CTGAGAT 682 
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His-tagged Clone 2.3 without signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 

His-tag Start Codon 
AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 
Met Gly Ser Ser His His His His His His Ser 
-30 -25 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-20 -15 -10 

N-terminal of mature AFP 
GGA CAG CAA ATG GGT CGC GGA TCC CTC ACC GAC GAA CAG ATA CAG 
Gly Gin Gin Met Gly Arg Gly Ser Leu Thr Asp Glu Gin lie Gin 
-5 1 5 

AAA AGG AAC AAG ATC AGC AAA GAA TGC CAG CAG GTG TCC GGA GTG 
Lys Arg Asn Lys lie Ser Lys Glu Cys Gin Gin Val Ser Gly Val 
10 15 20 

TCC CAA GAG ACG ATC GAC AAA GTC CGC ACA GGT GTC TTG GTC GAT 
Ser Gin Glu Thr lie Asp Lys Val Arg Thr Gly Val Leu Val Asp 
25 30 35 

GAT CCC AAA ATG AAG AAG CAC GTC CTC TGC TTC TCG AAG AAA ACT 
Asp Pro Lys Met Lys Lys His Val Leu Cys Phe Ser Lys Lys Thr 
40 45 50 

GGA GTG GCA ACC GAA GCC GGA GAC ACC AAT GTG GAG GTA CTC AAA 
Gly Val Ala Thr Glu Ala Gly Asp Thr Asn Val Glu Val Leu Lys 
55 60 65 

GCC AAG CTG AAG CAT GTG GCC AGC GAC GAA GAA GTG GAC AAG ATC 
Ala Lys Leu Lys His Val Ala Ser Asp Glu Glu Val Asp Lys lie 
70 75 80 

GTG CAG AAG TGC GTG GTC AAG AAG GCC ACA CCA GAG GAA ACG GCT 
Val Gin Lys Cys Val Val Lys Lys Ala Thr Pro Glu Glu Thr Ala 
85 90 95 

TAT GAC ACC TTC AAG TGT ATT TAC GAC AGT AAA CCT GAT TTC TCT 
Tyr Asp Thr Phe Lys Cys lie Tyr Asp Ser Lys Pro Asp Phe Ser 
100 105 110 

Stop Codon 

CCT ATT GAT TAA CTCGAGCACC ACCACCACCA CCACTGAGAT 
Pro lie Asp * 
115 
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His-tagged Tin 13.17 with signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 50 

His-tag Start Codon 
AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 96 
Met Gly Ser Ser His His His His His His Ser 
-65 -60 -55 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 141 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-50 -45 -40 

GGA CAG CAA ATG GGT CGC GGA TCC GAA TTC TGG ATC CAA AGA ATT 186 
Gly Gin Gin Met Gly Arg Gly Ser Glu Phe Trp lie Gin Arg lie 
-35 -30 -25 

AFP Start Codon 

CGG CAC GAG ACT ACT AAG ATG AAG TTG CTC TGT TGT CTA ATC TCC 231 

Arg His Glu Thr Thr Lys Met Lvs Leu Leu Cvs Cvs Leu lie Ser 

-20 -15 -10 

N- terminal of mature AFP 
CTC ATT CTG TTG GTC ACA GTT CAG GCC CTG ACC GAG GCA CAA ATT 27 6 

Leu He Leu Leu Val Thr Val Gin Ala Leu Thr Glu Ala Gin He 
-5 15 

GAG AAA CTG AAC AAG ATC AGC AAA AAA TGT CAA AAT GAA AGT GGA 321 
B Glu Lys Leu Asn Lys He Ser Lys Lys Cys Gin Asn Glu Ser Gly 

10 15 20 

GTG TCG CAA GAG ATC ATA ACC AAA GCT CGC AAC GGT GAC TGG GAG 366 
Val Ser Gin Glu He He Thr Lys Ala Arg Asn Gly Asp Trp Glu 
25 30 35 

GAC GAT CCT AAA CTG AAA CGC CAA GTT TTT TGC GTG GCC AGG AAC 411 
Asp Asp Pro Lys Leu Lys Arg Gin Val Phe Cys Val Ala Arg Asn 
40 45 50 

GCC GGT CTG GCC ACG GAA TCG GGA GAG GTG GTG GTC GAC GTG TTG 456 
Ala Gly Leu Ala Thr Glu Ser Gly Glu Val Val Val Asp Val Leu 
55 60 65 

AGG GAG AAG GTG AGG AAG GTC ACT GAC AAC GAC GAA GAA ACT GAG 501 
Arg Glu Lys Val Arg Lys Val Thr Asp Asn Asp Glu Glu Thr Glu 
70 75 80 

AAA ATC ATC AAT AAG TGC GCC GTC AAG AGA GAT ACT GTT GAA GAG 546 
Lys He He Asn Lys Cys Ala Val Lys Arg Asp Thr Val Glu Glu 
85 90 95 

ACG GTG TTC AAT ACT TTC AAA TGT GTC ATG AAA AAC AAG CCA AAG 595 
Thr Val Phe Asn Thr Phe Lys Cys Val Met Lys Asn Lys Pro Lys 
100 105 110 

Stop Codon 

TTC TCA CCA GTT GAT TGA ACCACCACGA CTAGTAGATG GTTCAAATGG 643 
Phe Ser Pro Val Asp * 
115 

Polyadenylation signal Poly-A tail 
TGTGCTTTAC ATATAA AAAT AAA GTGTTTC TGATGTAAAA AAAAAAAAAA 693 

AAAAAAAAAA AACTCGAGAG TATTCTAGAG CGGCCGCGGG CCCATCGTTT 743 

TCCACCCCTC GAGCACCACC ACCACCACCA CTGAGAT 777 
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His-tagged Tm 13.17 without signal sequence 



TTGTTAGCGG ATGGAATTCC CTCGTAGGGG ATAATTTTGT TTACTTTAAG 

His-tag Start Codon 
AAGGAGATAT ACC ATG GGC AGC AGC CAT CAT CAT CAT CAT CAC AGC 
Met Gly Ser Ser His His His His His His Ser 
-30 -25 

AGC GGC CTG GTG CCG CGC GGC AGC CAT ATG GCT AGC ATG ACT GGT 
Ser Gly Leu Val Pro Arg Gly Ser His Met Ala Ser Met Thr Gly 
-20 -15 -10 

N- terminal of mature AFP 
GGA CAG CAA ATG GGT CGC GGC CTG ACC GAG GCA CAA ATT GAG AAA 
Gly Gin Gin Met Gly Arg Gly Leu Thr Glu Ala Gin lie Glu Lys 
-5 15 

CTG AAC AAG ATC AGC AAA AAA TGT CAA AAT GAA AGT GGA GTG TCG 
Leu Asn Lys lie Ser Lys Lys Cys Gin Asn Glu Ser Gly Val Ser 
10 15 20 

CAA GAG ATC ATA ACC AAA GCT CGC AAC GGT GAC TGG GAG GAC GAT 
Gin Glu lie lie Thr Lys Ala Arg Asn Gly Asp Trp Glu Asp Asp 
25 30 35 

CCT AAA CTG AAA CGC CAA GTT TTT TGC GTG GCC AGG AAC GCC GGT 
Pro Lys Leu Lys Arg Gin Val Phe Cys Val Ala Arg Asn Ala Gly 
40 45 50 

CTG GCC ACG GAA TCG GGA GAG GTG GTG GTC GAC GTG TTG AGG GAG 
Leu Ala Thr Glu Ser Gly Glu Val Val Val Asp Val Leu Arg Glu 
55 60 65 

AAG GTG AGG AAG GTC ACT GAC AAC GAC GAA GAA ACT GAG AAA ATC 
Lys Val Arg Lys Val Thr Asp Asn Asp Glu Glu Thr Glu Lys lie 
70 75 80 

ATC AAT AAG TGC GCC GTC AAG AGA GAT ACT GTT GAA GAG ACG GTG 
lie Asn Lys Cys Ala Val Lys Arg Asp Thr Val Glu Glu Thr Val 
85 90 95 

TTC AAT ACT TTC AAA TGT GTC ATG AAA AAC AAG CCA AAG TTC TCA 
Phe Asn Thr Phe Lys Cys Val Met Lys Asn Lys Pro Lys Phe Ser 
100 105 110 

Stop Codon 

CCA GTT GAT TGA CTCGAGCACC ACCACCACCA CCACTGAGAT 
Pro Val Asp * 
115 
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1 AGTGGATCCAAAGAATTCGGCACGAGACTACTAA(^l3AAGTi?GCTCTGTTGTCTAATCT 

M K L L C C'L'I S 

61 CCCTCATTCTGTTGGTCACAGTO 

LILLVTV OA ^L TEAQIE KLNK 

121 AGATCAGCAAAAAATGTCAAAATGAAAGTGGAGTGTCGCAAGAGATCATAACCAAAGCTC 
I S.KKCQNESGVSQE I ITK AR 

181 GCAACGGTGACTGGGAGGACGATCCTAAACTGAAACGCCAAGTTTTTTGCGTGGCCAGGA 
NGDWE DD PKLKRQVF CVA R N 

241 ACGCCGGTCTGGCC^CGGAATCGGGAGAGGTGGTGGTCGACGTGTTGAGGGAGAAGGTGA 
AG LATE S GEVVV DVL RE K V R 

301 GGAAGGTCACTGACAACGACGAAGAAACTGAGAAAATG 

KVTDNDEETEKIINKCAVKR 

361 GAGATACTGTT.G^AGAGACGGTGTTCAATACTTTCAAATGTGTCATGAAAAACAAGCCAA 
DTVEETVFNTFKCVMKNK P K 

42 1 AGTT(£ j^CCAGTTGAT^^ CCACCACGACTAGTAG^TGGTTCAAATGGTGTGCTTTAC 
F S P V D * 

X 
h 
o 
I 

481 ATATAAAAATAAAGTGTTTCTGATGTAAAAAAAAAAAAAAAAAAAAAAAAAACTC 
polyadenylation signal poly (A) tail (26) 

537 AGAGTATTCTAGAGCGGCCGCGGGCCCATCGTTTTCCACCC 
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1 GGCACGAGCAAA A |A T G| A A ACTCCT CT T 6T GCT T T GCGTT C6CCGCC 



MKLLLCFAFAA 

47 ATCGTCATOGGAGCTCAGGCT ^J^KQ^QAS^XKSA^ TACAGAAA 
I VI GAGA^LTDEQI QK 

92 AGGAACAAGATCAGCAAAGAATGCCAGCAGGTGTCCGGAGTGTCC 
RNKI SKECQGVSGVS 

137 CAAGAGACGATCGACAAAGTCCGCACAGGTGTCTTGGTCGATGAT 
QETI DKVRTGVLVDD 

182 CCCAAAATGAAGAAGCACGTCCTCTGCTTCTCGAAGAAAACTGGA 
PKMKKHVLCFSKKTG 

226 GTGGCAACCGAAGCCGGAGACACCAATGTGGAGGTACTCAAAGCC 
VATEAGDTNVEV L K A 

271 AAGCTGAAGCATGTGGCCAGCGACGAAGAGGTGGACAAGATCGTG 
KLKHVASDEEVDKI V 

316 CAGAAGT GCGT GGT CAAGAAGGCCACACCAGAGGAAACGGCTT AT 
QKCVVKKATP EETAY 

361 GACACCTTCAAGTGTATTTACGACAGCAAACCTGATTT £X£ "GT GJl) 
DTFK CI YDSKPDFSP 

406 AX.J GA'J T A A^T TGTTTTGTATTTGACTGAATTTTGACAATAAAGGT 
, D * 

polyadenylation signal 

451 ACTATCGTTATGTAAAAAAAAAAAAAAAAA 
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